Skip to content

Conversation

@copybara-service
Copy link

Feat: Add cost analysis of Pallas kernels using LLO tracing
This change introduces the capability to perform cost analysis of Pallas kernels within XProf. Pallas kernels are represented as "custom-call" HLOs, and this change enables XProf to estimate their performance characteristics (flops, IOPS, and DMA bandwidth) by analyzing Low-Level Optimized (LLO) instruction traces.

This change introduces the capability to perform cost analysis of Pallas kernels within XProf. Pallas kernels are represented as "custom-call" HLOs, and this change enables XProf to estimate their performance characteristics (flops, IOPS, and DMA bandwidth) by analyzing Low-Level Optimized (LLO) instruction traces.

PiperOrigin-RevId: 831937474
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants